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Abstract — Polar codes have attracted much recent attention as 
the first codes with low computational complexity that provably 
achieve optimal rate-regions for a large class of information- 
theoretic problems. One significant drawback, however, is that 
for current constructions the probability of error decays sub- 
exponentially in the block-length (more detailed designs improve 
the probability of error at the cost of significantly increased 
computational complexity |1|). In this work we show how the 
the classical idea of code concatenation - using "short" polar 
codes as inner codes and a "high-rate" Reed-Solomon code as 
the outer code - results in substantially improved performance. 
In particular, code concatenation with a careful choice of pa- 
rameters boosts the rate of decay of the probability of error to 
almost exponential in the block-length with essentially no loss 
in computational complexity. We demonstrate such performance 
improvements for three sets of information-theoretic problems 
- a classical point-to-point channel coding problem, a class of 
multiple-input multiple output channel coding problems, and 
some network source coding problems. 

I. Introduction 

Polar codes [2] are provably capacity-achieving codes for 
the Binary Symmetric Channel, with encoding and decoding 
complexities that scale as O(NlogN) in the block-length 
N. Polar codes have since demonstrated their versatility - 
capacity-achieving low-complexity schemes based on polar 
codes have been demonstrated for a wide variety of source and 
channel coding problems - for instance, some point-to-point 
discrete memoryless channels, some rate-distortion problems, 
the Wyner-Ziv problem and the Gelfand-Pinsker problem O . 

A significant drawback remains. Under the original defini- 
tion of polar codes their minimum distance can be shown J4] 
to grow no faster than —o(y/~N) in the block-length N. Hence 
the probability of error decays no faster than cxp(— o(y/~N)) 
(compared with the cxp(— Q(N)) probability of error provable 
for random codes 0). Low-complexity codes achieving this 
performance have been constructed 1 6 QJ. Further work to 
improve the decay-rate of the error probability was partially 
successful - a sequence of codes have been constructed [ 1 1 that 
in the limit (of a certain implicit parameter denoted I) achieves 
exp(— o(N)) probability of error; however this improvement 
comes at the expense of significantly increased computational 
complexity, which scales as 0(2 l N log N). 

In this work we demonstrate that concatenating short polar 
codes with a high-rate outer Reed-Solomon code significantly 
improves the rate of decay of the probability of error, with 
little or no cost in computational complexity. For the point-to- 
point channel coding problem we use capacity-achieving polar 
codes of block-length 9 (log 3 N) as the inner codes. 

1 The original work [ 2 ] only proved that the probability of error decayed as 
OiN- 1 / 4 ). 



There are three cases of interest. The first case is at one 
extreme, in which "many" of the inner codes (at least a 
log -3 / 2 N fraction) fail resulting in a decoding error with 
probability cxp(-ft(7V/(log" 27/8 N)) for our concatenated 
coding scheme. This is the only scenario in which our scheme 
decodes erroneously. 

The second scenario is at the other extreme, in which none 
of the inner codes fail. We show here that if the outer code is 
a systematic Reed-Solomon code with a rate that approaches 
1 asymptotically as a function of N, the decoder can quickly 
verify and decode to the correct output with computational 
complexity O (N (poly (log N))). We show further that the 
probability of this scenario occurring is 1 — o(l/N), hence 
this is the probabilistically dominant scenario. 

The third scenario is the intermediate regime in which at 
least one (but fewer than a log -3 / 2 N fraction) of the inner 
codes fail. Here we show that the Reed-Solomon outer code 
can correct the errors in the outputs of the inner codes. The 
complexity of decoding in this scenario 0(N 2 ) is domi- 
nated by the Berlekamp-Massey decoding algorithm for Reed- 
Solomon codes 0. However, since this scenario occurs with 
probability o(l/N), the average computational complexity is 
still dominated by the second scenario. 

We then extend these techniques to two other classes of 
problems. The first class is a general class of multiple-input 
multiple-output channels, which include as special cases the 
multiple-access channel and the degraded broadcast channel. 
The second class is that of network source coding, which 
includes as a special case the Slepian-Wolf problem. [8| Prior 
to polar codes no provable low-complexity capacity achieving 
schemes were known that achieved the optimal rates for these 
problems - in all cases our codes improve on the rate of decay 
of the probability of error that polar codes attain, while leaving 
other parameters of interest essentially unchanged. 

Our concatenated code constructions preserve the linearity 
of the polar codes they are built upon. This is because both 
the inner polar codes and the outer Reed-Solomon code have 
a linear encoding structure, albeit over different fields (F2 and 
W q respectively). However, if we choose the field for the outer 
code so that q = 2 r for some integer r, all linear operations 
required for encoding over ¥ q may be implemented asrxr 
matrix operations over F2. Hence the encoding procedures for 
both the inner and the outer code may be composed to form 
a code that is overall a linear code over F2. 

The remainder of the paper is as follows. Section HI] gives 
the relevant background of Reed-Solomon codes, polar codes, 
and concatenated codes respectively. The required notation for 
this work is summarized in Figure Q] Section [HI] describes 



our concatenated channel coding scheme, first for the point- 
to-point channel, and then for a general class of multiple- 
input multiple-output channels. Section HVl describes our cor- 
responding concatenated source coding scheme. 

II. Background 

A. Polar codes (2): are provably capacity-achieving codes 
with low encoding and decoding complexity for arbitrary 
binary-input symmetric discrete memoryless channels. To sim- 
plify presentation we focus on binary symmetric channel^] 1 8 1 
(BSC(p)) though many of the results can be generalized to 
other channels [3]. A crucial component of a polar code is 
a binary I x I "base matrix" denoted G (polar encoders and 
decoders depend on the matrix resulting when G is tensored 
with itself multiple times). We replicate here some important 
properties of polar codes relevant for this work. 

Let the inner code rate of the polar code be denoted by Rj 
. The polar encoder takes as input Rjn bits. The polar encoder 
outputs a sequence of n bits that are then transmitted over the 
channel. As the channel is a BSC(p), the channel flips each 
bit independently with a probability p. The polar decoder then 
attempts to reconstruct the encoder's Rjn bits. 
Probability of error: The best known rate of decay of the 
probability of error of polar codes with increasing block-length 
M (see GQ, 0, (6)) is (exp(-o(M^))). Here 0(1), called 
the exponent of the polar code is a function that is challenging 
to comput^l but for instance it is known that (3(2) = 0.5, 
/3(l) < 0.5 for I < 15, and (3(1) < 0.6 for I < 30. 
Complexity: Both the encoding and decoding complexity of 
polar codes scale as 0(2 nlogn). 

Rate: While an exact characterization of the speed of conver- 
gence of the rate to the Shannon capacity is not known, it 
is known that polar codes are asymptotically rate-optimal. In 
this work we denote the (unknown) rate redundancy of polar 
codes by S(n). 

We note that the rate of decay of the probability of error 
can be traded off with the computational complexity of polar 
codes. However, due to the exponential dependence of the 
computational complexity on the parameter I, this cost may 
be significant for codes that have an exponent close to 1. 
Other rate-optimal channel codes: There has been much at- 
tention on the excellent empirical performance (asymptotically 
capacity achieving, low encoding and decoding complexity, 
fast decay of the probability of error) of LDPC codes (9). 
However, as of now these results are still not backed up by 
theoretical guarantees. On the other side, the state of the art 
in provably good codes are those of Spielman et al. [10|, 
which are asymptotically optimal codes that have provably 
good performance in terms of computational complexity and 
probability of error. However, these codes too have their 
limitations - their computational complexity blows up as the 
rate of the code approaches capacity. 

2 Binary-input binary-output channels that flip each bit input to them 
according to a Bernoulli(p) distribution. 

3 Upper and lower bounds on the growth of /?(/) with I are known [T] - 
these bounds are again not in closed form and require significant computation. 



B. Reed-Solomon codes fill : (henceforth RS codes) are 
classical error-correcting codes. Let the outer code rate Ro 
of the RS code be any number in (0,1). The RS encoder 
takes as input Rom symbol^ over a finite field ¥ q (here 
the rate Ro, field-size q, and the outer code's block-length 
m are code design parameters to be specified later). The RS 
encoder outputs a sequence of m symbols over F 9 that are 
then transmitted over the channel. The encoding complexity 
of RS-codes is low - clever implementations are equivalent to 
performing a Fast Fourier Transform over ¥ q (henceforth ¥ q - 
FFT) JTJl. But this can be done with O(m\ogm) operations 
over F g , or equivalently O(mlogmlogg) binary operations 
(for large q). 

The channel is allowed to arbitrarily corrupt up to m(l — 
Ro)/2 symbols to any other symbols over ¥ q . 

For all sufficiently large q (in particular if q > m), the 
standard RS decoder Q can reconstruct the source's message 
with no error (as long as the channel has indeed corrupted 
fewer than m(l — Ro) /2 symbols). The fastest known RS de- 
coders for errors require 0(m 2 log m log q) binary operations 
(for large q) Q. 

In this work, we are interested in systematic RS codes 0. 
A systematic RS code is one in which the first Rom symbols 
are in fact the same as the input to the RS encoder, and 
the remaining (1 — Ro)m parity-check symbols correspond 
to the outputs of a generic RS decoder. These are used in 
our concatenated code constructions to give highly efficient 
decoding (0(m 2 log m log q) binary operations) of "high-rate" 
RS codes when, with high probability (w.h.p.) no errors occur. 
Details are in Section ITIT1 

C. Code concatenation |13|: proposed by Forney in 1966, 
means performing both encoding and decoding in two layers. 
The source information is first broken up into "many" chunks 
each of "small" size, and some redundancy is added via an 
outer code. Then each chunk is encoded via a separate inner 
code. The result is the transmitted codeword. 

The power of the technique lies in the complementary nature 
of the two layers. It turns out that the while the probability of 
error of the inner code decays slowly due to the small block- 
length of each chunk, its encoding and decoding complexities 
are also low for the same reason. In contrast, since the outer 
code has many chunks to code over it can achieve a fast 
rate of convergence of the probability of error. However, 
since the outer code is over a larger alphabet, algebraic codes 
with efficient encoding and decoding techniques (such as RS 
codes) can be employed. Combining the two layers with the 
appropriate choice of parameters results in an overall code with 
low encoding and decoding complexity, and also a reasonably 
fast decay of probability of error. 

In the context of channel coding, we summarize our notation 
in Figure Q] A detailed description of the corresponding code 
concatenation scheme is given in Section IllTl 

4 As is standard we assume here that m / Ro is an integer - if not, choosing 
a large enough m allows one to choose R'q Ro resulting in codes with 
approximately the same behaviour in all the parameters of interest. 



Parameter 


Meaning 


Our parameter choice 


11 


Block-length of inner codes (over F2) 


\og A N 


Ri 


Rate of inner codes 


1 - H b (p) - 5(\og 3 N) 


Pt 


Probability of error of inner codes 


cxp(-f7(log 9/8 A)) 


q = 2 R ' n 


Field-size of outer code 


2Ri log 3 N 


m 


Block-length of outer code (over ¥ q ) 


A log -3 A 


Ho 


Kate of outer code 


1 Ol ■ — 3/2 AT 

1 — 2 log ' iv 


N = nm 


Block-length of overall code (over F2) 


A 


M = RrfoN 


Number of source bits 


(l-H b (p)- <5(log 3 A)) (l - 2 log" 3 / 2 N\ A 


Pe 


Probability of error of overall code 


cxp{-n(N\og- 27/8 A)) 



Fig. 1. Summary of notation for the binary symmetric channel 



III. Channel coding 

For ease of exposition we first outline our main ideas for 
a point-to-point channel. In particular, we focus on the binary 
symmetric channel BSC(p), though our results can directly be 
extended to more general scenarios. 

As is well-known, the optimal rate achievable asymptoti- 
cally for large block-length for a BSC(p) equals 1 — Hb(p), 
where H b (.) refers to the binary entropy function (8]. 

A. Binary symmetric channel 

For each A we construct our codes as follows. 

1 ) Encoder: Let n — log 3 A be the block-length of the 
inner polar codes, R j = 1 — H b {p) — 8{n) = 1 — H b {p) — 
8(\og 3 N) be their rate, and pi = exp(-ft(log 3/3 A)) be 
their probability of error. (Here f3 is any value in (1/3, 1/2). 
The upper bound arises due to the provable rate of decay 
of the probability of error of polar codes |3|, and the lower 
bound arises due to a technical condition required in (fl~|i. For 
concreteness we set /3 = 3/8.) Let fj : ¥ R ' n :-s> F£ and 
gi : Fg :— >■ ¥ Rin denote respectively the encoder and decoder 
of an inner polar code. 

Correspondingly, let m = N / n = N / log 3 N be the block- 
length of the outer systematic RS code, q = 2 R ' n = 2 Rl log3 N 
be the field-sizeH and R = 1-2 log _3/2 N be its rate 
(so that the corresponding RS decoder can corrupt up to a 
fraction log _3/2 N of symbol errors). Let f : ¥ Rom -> F™ 
and go '■ F^ — > F R ° m denote respectively the encoder 
and (Berlekamp-Massey) decoder for the outer systematic RS 
code. 

Let M = R0R1N . Define the concatenated code through 
the encoder function / : W^ 1 — s- such that for each source 
message u M e {0, 1} M , 

/(u M )-(/ / (xi),//(x 2 ),...,/ / (x„ l )) ; 

where for each i in {1, 2, ... , m}, Xj represents the ith symbol 
of the output of the outer systematic RS encoder /o(u M ), 
viewed as a length-i?/n bit vector. 

As noted in the introduction, since the inner code is linear 
over Fa, and the outer code is linear over a field whose size 

5 For this choice of parameters q = ui{m), as required for RS codes. 



is a power of 2 (and as such may be implemented via matrix 
operations over F2) the overall encoding operation is linear. 

2) Channel: The channel corrupts each transmitted inner 
code vector Xj to y i7 resulting in the output bit-vector y N . 

3) Decoder: The concatenated polar code decoder proceeds 
as follows: 

1) It decodes each successive n bit- vector yi , y2, • • • , y m 
using the inner polar code decoders g/s to the length-i?/n 
bit vectors xi,x 2 , . . . ,x m = pj(yi), Sj(ya), • • ■,9i(y m )- 

2) It passes the first Rom outputs xi,xa,. .. ,XR om of the 
inner code decoders through the systematic RS encoder fo- 

3) If /o(xi,x 2 , . . . , XRom) = xi, x 2 , . . . , x m then it declares 
xi,ya, • • • )Xfl oTO as the decoded message (denoted by x M ) 
and terminates. 

4) Otherwise it passes yx , y 2 , • • • , y m through the outer de- 
coder go (a standard RS Berlekamp-Massey decoder), de- 
clares the length-Af bit-vector corresponding to the output 
xj , x|, . . . , x* Rom = ffo(xi, x 2 , . . . , x m ) as the decoded mes- 
sage (denoted by x M ), and terminates. 

The rationale for this decoding algorithm is as follows. Step 
1 uses the m inner codes to attempt to correct the errors in each 
symbol of the outer code. If each resulting symbol is indeed 
error-free, then since the outer code is a systematic RS code, 
re-encoding the first Rom symbols (Step 2) should result in 
the observed decoder output (Step 3). On the other hand, if the 
inner codes do not succeed in correcting all symbols for the 
outer code, but there fewer than (l — Ro)m/2 = N\og~ 9 ^ 2 N 
errors in these m — N log" 3 N symbols, then the RS outer 
decoder succeeds in corrected all the outer code symbols (Step 
4). Hence an error occurs only if there are N log -9 / 2 N or 
more errors in the outer code. The probability of this event 
can be bounded from above by the following theorem. 

Theorem 1. For the concatenated polar code defined above, 
for all N large enough, P e < exp(-6(A r /log 27/8 AT)). 

Proof: By the polar code exponent of J6], our specific 
choice of /3 — 3/8 in the concatenated polar encoder, and for 
n large enough, the probability that any specific inner code 
fails is at most 

Pr( g/ ( Xl ) ^ x 4 ) < exp(-n' 3 ) = exp(- log 9/8 N). 



As noted above our code construction fails only if 
Alog~ 9 ^ 2 A or more of the m = A log -3 A inner codes 
fail. Hence the probability of error 

* eXP (io^^(io^))( eXP (-^g^ 

where the second inequality is due to Stirling's approxima- 
tion |8|. Next we note that for lim e _>.o Hf,(e)/e a — for every 
a € [0, 1). In particular, by choosing a = 1/2, we obtain, 

P e < exp (A(log- 3 A) (log~ 3/4 A^j - ^A log~ 27/8 a) ) 

< exp(9(Alog" 27/8 A)), 

for large enough A. Finally, we see that this construction is 
capacity achieving since the inner codes and outer code are 
constructed at rates asymptotically (in A) approaching channel 
capacity and 1 respectively. ■ 
Notice here that with the above choice of parameters n and 
to, the expected number of errors in the received codeword 
for the outer codeword approaches zero. Therefore, with a 
high probability, we receive the transmitted codeword without 
any error. We exploit this fact in showing that the average 
complexity of the decoding algorithm is dominated by the 
complexity of the verification step. This is characterized in 
the following result. 

Theorem 2. With the choice of parameters as in Theorem\l\ 
the above algorithm runs in O(AlogA) time. 

Proof: Our inner decoder decodes each of the 
6 (A/ log 3 A) inner codes using the standard polar code 
decoder. By the standard polar code successive cancellation 
decoding procedure J3, the computational complexity of 
decoding each inner code is C(log 3 A log(log 3 A)), which 
equals (log 3 A log log A)). Since there are 9(A/log 3 A) 
inner codes, the overall decoding complexity of the inner code 
decoders is O (A log log A)). This is dominated by the next 
decoding step, and hence we neglect this. 

We note that for our code construction the average decoding 
complexity of a systematic RS code can be reduced so as to ap- 
proximately equal its encoding complexity (O(m\ogm\ogq) 
binary operations), as follows. 

Recall our outer decoder follows the following procedure. 
It first performs the encoding operation again on the first R m 
symbols and compares the output of this second encoding 
process with the observed m symbols. If these two sequences 
are indeed the same the decoder outputs the first R m symbols 
as the decoded output and terminates. If not the decoder then 
defaults to standard RS decoding [7| to output R m symbols 
as the decoded output and terminates. 

Let Pi denote the probability that at least one subblock has 
been decoded erroneously by the polar decoder. Since Pi < 
mexp(-n 3 / 8 ) for our choice of /3 = 3/8, P\ decays as 

Pi < exp(C((logA) 9/8 ))(log~ 3 A) =o(l/m). (1) 



We now consider the complexity of this decoder for the 
three scenarios for the outer code. 

1) The channel for the outer code corrupts no symbols, 
with probability at least 1 — o(l/m): In this case the 
decoder terminates after the first step, having performed 
v O(m\ogm\ogq) elementary operations. 
] j 2) The channel for the outer code corrupts fewer than 
' m(l — i? )/2 symbols, with probability at most 1 — 
o(l/m): In this case the decoder terminates af- 
ter the second step, having performed an additional 
0(m 2 log to log g) elementary operations. 
3) The channel for the outer code corrupts m(l — 
i?o)/2 or more symbols, with probability at most 
exp{— f2(— A log 27 / 8 A)): This case is an alternative 
for the previous scenario, and also terminates after an 
additional 0(m 2 log to log q) elementary operations. 

Thus the overall average computational complexity is 
0{rn log to log q). Recalling that for our choice of parameters 
to = 9 (A/ log 3 A) and q = cxp(9(log 3 A)), this gives an 
overall complexity of C(Alog A). ■ 
Observe that concatenation essentially operates on the mes- 
sage bits and is independent of the channel under consider- 
ation. Therefore, if each transmitter has more than one kind 
of messages, like in the case of a multiple-input multiple- 
output channel, the operation outlined above can be repeated 
for each message separately. Likewise, at each decoder, the 
above decoding rule may be applied for each message after 
decoding the inner code. Since polar codes have been shown 
to be optimal for certain multiuser channels (3J, and similar 
results on the rate of polarization continue to hold, we have 
the following corollary: 

Corollary 1 (Multiple Access Channel and Degraded Broad- 
cast Channel). For the two user multiple access channel 
Py\x 1 x 2 ('\') an d the degraded broadcast channel Px 1 x 2 \Y> 
the code constructed by concatenating a polar code of block 
length log 3 A with a RS code of block length A log 3 A has 
an error probability that decays as exp(— r2(Alog~ 27 ^ 8 A)) 
and can be encoded and decoded in 0(N log A). 

IV. Code Design for Network Source Coding 

The results in previous sections demonstrate that concate- 
nation is helpful in reducing the error probability for channel 
coding. In this section, we show that the error probability 
for network source coding may be similarly reduced via 
concatenation. 

A network Af = (V,E) is a directed graph. Denote by 
S C V and T C V, respectively, the sets of source and sink 
nodes. Each source node s € S observes a source random 
process U(s), and each sink node t 6 T demands a subset 
W(t) = (U(s) : s £ S t ) of these for some S t C S. The 
random process {(Ui(s) : s £ S)}^1 1 is drawn i.i.d. from a 
known probability mass function P(-). In our discussion, we 
consider only binary sources. It can be seen that the technique 
outlined here extend to non-binary sources as well. 



For a collection of rates (R e > : e € E), an 
(n, (2 nRe ) ee E) network source code (fW,G™) comprises 
of encoder maps 

F (n) = (j e («) : e & E) and decoder maps 

Q(n) _ ( fl (n) ;i g wkh 

/(St) : II n R '^K R{ ^{v,v')&Y {V\S) 

g[ n) : Y[ ¥% Re -> ]J V* e T. 
eei\(t) ses t 

The probability of error for the above code is defined as 

P e 4 Pr( 3t (n) (/ e : e e ^(t)) ^ Wr(t), W n (t) for some i). 

A. Concatenated Code Construction 

As in the previous constructions, fix n = log 3 M and 
m = M log -3 M to be the input block-lengths for the inner 
and outer codes. The outer code is chosen to be a systematic 
RS code. The concatenated code construction for this case is 
similar to that for channel coding, except for a few differences. 
For an input block length M = nm, the code (T (M) ,G (M) ) 
is constructed as follows. 

1) Encoder: Let (F^.G'")) be a code that operates at 
a rate (R e : e e E) and . Let R = (1 - 2/n 4/3 / 3 ). Let 
hi = M/(nRo) be the codelength for the outer systematic 
RS code. Denote the outer code as f„ : F™ flo -> F™ . Let g a 
be the corresponding decoder. Note that since f a is chosen to 
be a systematic code, 

f (u M (s)) = u M (s)*f (u M (s)) 

for some function f Q . The encoding is performed in two steps: 

a) Apply the code (F^ n \G^) to each sub-block 
u ("-i)«+i( 5 ) for i = 1, . . .^m. 

b) For each s E S, multicast f (u MR °(s)) to each sink t 
such that s £ St- The excess rate added on each edge as 
a result of this transmission is upper bounded by | S\ (1 — 
Ro)/2. 

2) Decoder: Since each sink receives the redundant bits 
for the outer code for every desired source, as well as the 
codewords from the inner code, decoding can be performed 
in the following two steps: 

a) Apply the decoder functions to each sub-block sep- 
arately to obtain reconstructions w M (t) = (u M (s,t) : 

s e S t ). 

b) Output the reconstruction u (s,i) = 
g°(u M (s,t)Jz(u M (s)). 

The above construction technique is independent of the 
choice of the inner code. For specific networks such as 
Slepian-Wolf network and Ahlswede-Korner network etc, po- 
lar codes are shown to be optimal [3|. Further, the error 
probability is shown to vanish as exp (— n°) for (3 < 1/2. 
Since the length of the inner code and the outer code, and the 
rate of the outer code have been chosen in the same way as 



the concatenated channel code construction, as a corollary of 
Theorems Q] and [2] we have the following result: 

Corollary 2. Let M be a network such that there exists a 
sequence of polar codes {(F^ n \ G^)}^! that are optimal. 
Then, the sequence of concatenated codes outlined above 
are also optimal. Further, the error probability decays as 
exp(— VL(M log -27 / 8 M)) and the encoding, and decoding 
can be performed in Q(M logM) time. 

V. Conclusion 

In this work we examine the question of the tradeoff 
between the computational complexity of polar codes and the 
rate of decay of their probability of error. We demonstrate that 
using the well-studied technique of concatenation, the rate of 
decay of the probability of error can be boosted to essentially 
optimal performance. The question of the corresponding speed 
of convergence of the code rates to the optimal rate-region 
is still an interesting open question, as it is for the original 
formulation of polar codes. 
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